Noise robust speech recognition using F0 contour extracted by hough transform
نویسندگان
چکیده
This paper proposes a noise robust speech recognition method using prosodic information. In Japanese, fundamental frequency (F0) contour represents phrase intonation and word accent information. Consequently, it conveys information about prosodic phrase and word boundaries. This paper first proposes a noise robust F0 extraction method using Hough transform, which achieves high extraction rates under various noise environments. Then it proposes a robust speech recognition method using syllable HMMs which model both segmental spectral features and F0 contours. Speaker-independent experiments are conducted using connected digits uttered by 11 male speakers in various kinds of noise and SNR conditions. The recognition accuracy is improved in all noise conditions, and the best absolute improvement of digit accuracy is about 4.7%. This improvement is achieved due to the more precise digit boundary detection by the robust prosodic information.
منابع مشابه
Noise Robust Speech Recognitio Extracted by Hough Tr
This paper proposes a noise robust speech recognition method using prosodic information. In Japanese, fundamental frequency (F0) contour represents phrase intonation and word accent information. Consequently, it conveys information about prosodic phrase and word boundaries. This paper first proposes a noise robust F0 extraction method using Hough transform, which achieves high extraction rates ...
متن کاملNoise Robust Speech Recognition Using Prosodic Information
This paper proposes a noise robust speech recognition method for Japanese utterances using prosodic information. In Japanese, the fundamental frequency (F0) contour conveys phrase intonation and word accent information. Consequently, it also conveys information about prosodic phrase and word boundaries. This paper first proposes a noise robust F0 extraction method using the Hough transform, whi...
متن کاملNoise robust speech recognition using spectral subtraction and F0 information extracted by Hough transform
We propose a noise robust speech recognition method based on combining novel features extracted from fundamental frequency (F0) information and spectral subtraction. F0 features have been shown to be effective in speech recognition in noisy environments. Recently, F0 features obtained by Hough transform were developed for concatenated digit recognition and significantly improved recognition per...
متن کاملNoise-robust speaker verification using F0 features
This paper proposes a noise-robust speaker verification method augmented by fundamental frequency (F0). The paper first describes a noise-robust F0 extraction method using the Hough transform. Then, it proposes a robust speaker verification method using multi-stream HMMs which fuse the extracted F0 and cepstral features. Experiments are conducted using fourconnected-digit utterances of Japanese...
متن کاملAn Empirical Study of Tone Models for Robust Speech Recognition of Isolated Thai Words
Robust speech recognition system is necessary in real environment, whereas tone in Thai speech recognition system is an importance as well. Therefore, we attempt to find tone model for enhancing Thai speech recognition system in noisy environment. In this paper we present an empirical study by configuration fundamental frequency contour, which one of prosodic information for tone language. Fund...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2002